consumer financial protection bureau
Performance of diverse evaluation metrics in NLP-based assessment and text generation of consumer complaints
Gao, Peiheng, Yang, Chen, Sun, Ning, Zitikis, Ričardas
Machine learning (ML) has significantly advanced text classification by enabling automated understanding and categorization of complex, unstructured textual data. However, accurately capturing nuanced linguistic patterns and contextual variations inherent in natural language, particularly within consumer complaints, remains a challenge. This study addresses these issues by incorporating human-experience-trained algorithms that effectively recognize subtle semantic differences crucial for assessing consumer relief eligibility. Furthermore, we propose integrating synthetic data generation methods that utilize expert evaluations of generative adversarial networks and are refined through expert annotations. By combining expert-trained classifiers with high-quality synthetic data, our research seeks to significantly enhance machine learning classifier performance, reduce dataset acquisition costs, and improve overall evaluation metrics and robustness in text classification tasks.
Distinguishing Scams and Fraud with Ensemble Learning
Chadalavada, Isha, Huang, Tianhui, Staddon, Jessica
Users increasingly query LLM-enabled web chatbots for help with scam defense. The Consumer Financial Protection Bureau's complaints database is a rich data source for evaluating LLM performance on user scam queries, but currently the corpus does not distinguish between scam and non-scam fraud. We developed an LLM ensemble approach to distinguishing scam and fraud CFPB complaints and describe initial findings regarding the strengths and weaknesses of LLMs in the scam defense context.
FICO scores leave out 'people on the margins,' Upstart's CEO says. Can AI make lending more inclusive -- without creating bias of its own?
Dave Girouard, the chief executive of the AI lending platform Upstart Holdings Inc. UPST, -2.51% in Silicon Valley, understood the worry. "The concern that the use of AI in credit decisioning could replicate or even amplify human bias is well-founded," he said in his testimony at the hearing. But Girouard, who co-founded Upstart in 2012, also said he had created the San Mateo, Calif.-based company to broaden access to affordable credit through "modern technology and data science." And he took aim at the shortcomings he sees in traditional credit scoring. The FICO score, introduced in 1989, has become "the default way banks judge a loan applicant," Girouard said in his testimony.
Using Machine Learning to Categorize Texts into Topics
After reading a news article -- whether the subject matter is U.S. politics, a movie review, or a productivity tip -- you can turn to someone else and give them a general idea of what it's about, right? Or if you read a novel, you can classify it as maybe sci-fi, literary fiction, or a romance. Humans tend to be pretty good at classifying texts. And these days, computers can do it, too. For a recent machine learning project, I downloaded consumer complaints from the Consumer Financial Protection Bureau and developed models to classify the complaints into one of five product categories.